Extreme File Inversion

نویسنده

  • Shlomo Geva
چکیده

In this paper we describe the implementation of an extreme variation to the inverted file scheme. The scheme supports a comprehensive set of Boolean search operators, down to the single character level. When combined with a heuristic document ranking algorithm it supports retrieval of raw XML data, using the embedded tags as search arguments. We tested the system against a set of XML queries and the entire set of IEEE Computer Society publications 1995-2002, in XML format.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Design and Implementation of the Inversion File System

This paper describes the design, implementation, and performance of the Inversion file system. Inversion provides a rich set of services to file system users, and manages a large tertiary data store. Inversion is built on top of the POSTGRES database system, and takes advantage of low-level DBMS services to provide transaction protection, fine-grained time travel, and fast crash recovery for us...

متن کامل

Controllability of dynamic double helices: quantitative analysis of the inversion of a screw-sense preference upon complexation† †Electronic supplementary information (ESI) available: NMR, UV and CD spectroscopic data, energy-minimized structures and experimental details of new compound preparation. CCDC 1404043. For ESI and crystallographic data in CIF or other electronic format see DOI: 10.1039/c5sc02614h Click here for additional data file. Click here for additional data file.

We describe a quantitative analysis of the complexation-induced inversion of a screw-sense preference based on a conformationally dynamic double-helix structure in a macrocycle. The macrocycle is composed of two twisting units (terephthalamide), which are spaced by two strands (1,3-bis(phenylethynyl)benzene), and is designed to generate a double-helix structure through twisting about a C2 axis ...

متن کامل

Extreme regression.

We develop a new method for describing patient characteristics associated with extreme good or poor outcome. We address the problem with a regression model composed of extrema (maximum and minimum) functions of the predictor variables. This class of models allows for simple regression function inversion and results in level sets of the regression function which can be expressed as interpretable...

متن کامل

Improving Virtual Hardware Interfaces a Dissertation Submitted to the Department of Computer Science and the Committee on Graduate Studies of Stanford University in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy

Since the origin of virtual machine monitors in the 1960s, virtual hardware has often been designed with “impure” interfaces different from physical hardware interfaces. Paravirtualization, as this is now termed, is often used to simplify VMMs and boost VM performance. This thesis explores tradeoffs in a rarely seen form of paravirtual interface, where the virtual interface operates at a higher...

متن کامل

S-Index: a Hybrid Structure for Text Retrieval

Today, two classes of indexing methods enjoying wide applicability are the Inverted Index and the Superimposed Coding based Signature File (SC-SF). The former is most efficient in query processing but utilizes extra storage of size comparable to that of the textbase, whereas the latter is most efficient in storage utilization. The present study builds upon the results obtained in previous resea...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002